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ABSTRACT 

The paper deals with the estimation of survival function with the use of linear hazard function and exponential 
base line distribution. Considering cox PH model with exponential base line distribution, the maximum likelihood 
estimator of general linear parametric function of regressors and frailty parameter has been obtained by taking very general 
form of the regression matrix and estimator being biased its MSE is obtained. Further the same has been estimated by least 
square theory using Gauss-Markoff model and it has been compared with usual estimator. The results have been applied by 
taking kidney infection data. 
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1. INTRODUCTION 

The Cox proportional hazard model is extensively used in medical research as Survivor model. Cox proportional 
hazard model uses regression analysis for censored data. In Cox proportional hazard model the explanatory variables or co- 
variates are studied for the random effect of covariates on distribution of survivor times. Unfortunately we are unable to 
include the relevant covariates related to diseases in many cases due to unawareness of the factor. 

When random effects (frailties) are included, the failure time distribution obtained by integration over the 
distribution of frailties loses this simple cancellation property. For such failure time distribution Hougaard (1986a, 1986b), 
Prentice, Williams and Peterson (1981) have introduced some models. The Maximum Likelihood Estimation (MLE) 
method and the Best Linear Unbiased Predictor (BLUP) method have been used by Handerson (1975) and McGilchrist and 
Aisbett(1991). 

Particularly useful family of baseline model is obtained from a univariate lifetime model with hazard function 
by defining 

k(t/x) = fe 0 (t> ^ 

where X= (x ll X 2r ...,X p ) and£= (fi lr P 2 , 

and by including the baseline distribution of Z , the hazard function will be 

h(t/x) = z-.h a (t)e —‘ — 
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In this article we consider the baseline distribution of z as exponential distribution defined as 

/(O = e _ Vi, t > 0, e > o 

Let T be the random variable associated with the survivor function, 5 ( t j . One would like to examine the effect 
of particular regressor variables in relation to the proportional hazard assumption such as Y = logT = K_ ft + taking 

’’Ff? = e.j, being error term which corresponds to shared frailty model which relates to the covariates in addition to 
regressor covariates X_ . Since the exponential distribution has hazard function constant and whenever hazard function 
turns out to be constant, the base line distribution is exponential distribution and therefore we consider the base line 
distribution of each g- as 

f(e t )=Q 1 e ®, e s >0,0>0, t= 1,2, 

with 0 and Kar(e f ) = @ 2 , Cov(fi ( ,e-) = 0 ,for i =£ j, i = 1,2, ...n r j = 1,2,3, ...n 

So the model for X turns out as 

Y = (1.1) 

where Fis nxlvector, (F 1; ... 5^}, X is nxp matrix {X^, i = 1,2, = 1,2, ...p) of known 

constants, ft is px 1 vector of parameters ft lf ft 2 r ■■■ ftp and £ = is error vector responsible for frailty 

model. We assume that the distribution of each component S t of £ is to be referred as baseline exponential distribution. 

In this article we use Maximum Likelihood (ML) and Least square methods for estimating linear function of 
regression coefficients and frailty parameter such as 

A = C_ ft + S 6 

where C_ = (' C-p Cn, ... ,C\,J and (3 are known, which relates to the frailty and shared frailty parameters ft and 8 

in section 2. By using Gauss-Marcoff model in Cox PH model the estimation of regression and frailty parameter by taking 
exponential base line distribution has been dealt with in section 3. Further both methods have been illustrated by taking 
real data obtained by McGilchrist and Aisbett (1991) on kidney infection in section 4. 

2. ESTIMATION OF A BY MAXIMUM LIKELIHOOD METHOD. 

In Cox PH model, let individual frailty variable e t follow exponential distribution with p.d.f. as 

/(e r )= 0 e 8 , i? ; >0 , 0 > 0, i = 1,2, ...,n (2.1) 

with assumptions given in section 1 . 
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Let Y t = logT t where T t is the time recorded for the i ^ individual and Z follows negative exponential 

!¥■ '(3 

distribution with Cox PH model, having 

where = (X i±r X iSr ... r X t )as 



f Yi {y) = Q.exp 






y>ZiP,e > 0 



Now considering £ = [e lt e 2 , as parameters of n-variate negative exponential distribution the model 

given in (1.1) is 

Y=Xf?+e. 



Using the mean and variance of the exponential distribution of i? ■ , we have 
E(e) = where 1 is n-vector of unity. 

E{Y) = XP +01 and £>(S) = D(Y) = 0 2 i 

Let Y lr i 2 . ,Y n be a random sample on Y and let be the corresponding ordered random 

sample. Thus the likelihood L(p, 0) of 1^., Y z , ... , at y lf y 2 , ... , y n being 

W, ») = e-"exp t-2f=i (y, - 

= 0 -n eKp {— l (y — XP}/9) 

the m.l.e. of p of p is given by 

Xp=lY (l y 
i.e. XXp_ = X 

- if \xx\ * 0 

=*P = \, ■■ , f | i- | (2.2) 

- {{X X)~X if \xx\ = 0. 

Further, 

dlogL({?,8') 

=0 

d8 

leads to the m.l.e. 9 of 8 as 
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6 = Y - Y ( 



CD’ 



(2.3) 



where 



Y = IT= i- 

1 ^ 72 

For given p-vector C and a given constant <3, the m.l.e. A of A = C' /? + 36 is 



A — 

where 



(2.4) 



a = 



-S, if\X’X\^0 



(2.5) 



if\X'X\ = 0 

where is ge 118 rliSBd invsrse of matrix X' X in the sense of A A A — A. 

If C belongs to the row space of X 1 X and 1 belongs to the column space of X 1 X then A is obviously unique 
and unbiased. 

By using 

e(¥) = 9 +X t ‘P andE(Y {l j) = j^+X-f 

A= c'p + fiB 

= (C^X X)-^ 1 - S)Y(1) + SY 

E( &)= [C'(X'^ _1 X14r C'P 

£ 

= (a + rt-5) — h C 0 where a is defined in (2.5) 

re — 

A is unbiased for A if C belongs to the row space of X‘ X and 1 belongs to the column space of X 1 X. 

Further, also A is unbiased for A if 

= S, if \X'X\ = 0 

\c\X , X)~X'l= S, if \x’x\ = 0 

otherwise it is biased and the bias is 

Bias = [c r g'r) _1 x'i — 5] J 

a 6 
n 



( 2 . 6 ) 
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If A is unbiased under condition (2.6), the variance of the m.l.e. A of A is 

Van A) = E{A - E(A}f 

= a 2 Var(Y t0 ) + 5 2 Var(y) + 2aSCov(Y ClJ ,Y) 



a 2 0 2; S 2 0 z SaSO 2 

i 1 where a is defined in (2.2.5) 

n n n 

0° 



and the estimated VciT( A) is 

(Var(A)) = {C'iX'xy^'X) 2 - 

Also the estimated standard error is 

s, e, (A) = Jvar(Aj 



(2.7) 



( 2 . 8 ) 



The variance of the unbiased estimator of A under the conditions of estimability either given in (2.6) or C belongs 

[ i & z & z 

to the row space of XX and belongs to the column space of A X, is , which does not involve /?. We note that for 

■n — 



fixed C, S , n, p the minimization of Vclt(A) is equivalent to the minimization of 
^ holds. 

When the m.l.e., A of A is not unbiased estimator of A, the Mean Square Error(MSE) of A is 
MSE(, A) = E{A-A) 2 

= E{A- EA+EA- A) 2 

= E{ A- EA"f-\-(EA - A) 2 + 2E(A - Eb)(EA - A) 

a 2 0 2 3 2 e 2 , 2aS0 2 „ aS- 2 

= TT+ — + ~ + ( T ) 



s 2 e z 



when 



« 



(2.9) 



If — - > 0, the unbiased estimator of A is better than MSE estimator of A. 



•j jj 

As 9 > 0 r n > 0, the condition — (> 0) reduces to a(> 0) 
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i.e. > S (2.10) 

Generally, C ' (X 1 X) 1 X 1 1 is less than 1 and 5 is generally taken as 1. In general MSE estimator is better than unbiased 
estimator when Cox PH frailty model is used. 

3. ESTIMATION OF A BY LEAST SQUARE METHOD OF SHARED EXPONENTIAL FRAILTY 
MODEL 

Since the p.d.f f(t t ) of survivor time T t of i ttz individual is expressed in shared frailty model as 

/(£*) =h 0 (t i )e^ i+Wi ^ } (3.i) 



where is shared frailty variable and is hazard function of exponential distribution which is 

constant, H and the p.d.f. of n variates Zy, T 2 , T w will be 

£=nr=i/ct [ )=e tEsii?i+EwilFi) 

Taking logarithm, we get 

y= 2 

= I XiPt + Z 

= x'/? + w'T (3.2) 

and denoting W 1 'T by £ and taking n-vector F as n-variate frailty exponential distribution the model is 



Y=Xfi+ e 



which is defined in (1.1). 

Now reconstructing the model (1.1) as 

F=Jltf?+8l+e* (3.3) 

where £' = £ - 01 such that fT(£') = 0 and D(£' ) = 9 ~l 

Let 8) = (y - Xp - 9l)' (Y - Xp - 0l j 

then least square method demands to solve the following equations 







and 
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mm 

89 

Now 



= 0 



= 0 =>X’X0 = X'Y- JT'ei 



= 0=> l'JKfl+l' W = I F 

&& — 



From (3.4) we have 0 in terms of 9 as 

_ [(z'rr^'F - (x'fy-^x'ie if|x'jf| = o 
yJ ’ = {(X'xyx'Y- (X'X)~X‘ld if |X'Z| = 0 

= M(F - 19 ) 



where M = (X' X) _1 X r , taking \X’X\ d= Q 



Substituting 0 in (3.5), we get 

i ' x [ m(f - 10 )] + 1 ' ie=i‘y 

=> 1 'XMY- 1'XMie + 1'10 = i'F 
=> j[l f 1 - i'X'Ml] 9 = I'r - 1'XJVfF 



=>!'[/- XMjie = l'[F— XM\Y_ 



(3.4) 

(3.5) 



(3.6) 



=> r{i - x(x’x}~ i -x’]i 6 = i'[/- 

=> l r Aie = I'AY, where A = I- X{X r X)~ 1 X l and is 



idempotent matrix. 



=> § = 



l 'ay 
i f di 



Using (3.7) in (3.6) 







(X'X)~X'Y_- (X’X}~X'l 



1 AY 



if U'AT| = 0 

if \X l X\ = 0 



(3.7) 



(3.8) 



Also, 

D( 0 )= D(_rxy 1 x'(Y- 01 ) 
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= (X* d(y - 0l)^(X ( X) _1 

= <Jf'Z) _1 0 2 (3.9) 

REMARK (3.1) 

We note that (X' X) ^X‘Y_ is the usual regression estimator of {3 . Due to shared frailty, the estimator is reduced 
l! AY 

by the quantity (^X'Xj 1 X f l ~ ~ which shows that the frailty has negative effect in the estimation of primary 
covariates. 

Particularly, when & = Y, the shared frailty is neglected and in this case the regression frailty is usual, regression 
estimator, 0 = ( A ' A j 1 A ' F , and hence no new covariates are necessary to explain hazard rate or survivor function. This 
is due to exponential base line distribution. 

4. APPLICATION TO KIDNEY INFECTION PATIENTS. 

For illustration we use the kidney infection data given by McGilchrist and Aisbett (1991) of 38 kidney Patients 
using portable dialysis. A catheter is inserted to each admitted kidney Patient and keeps it until infection is removed, that 
is infection is cleared. After some time again catheter is inserted and removed when infection occurs due to kidney or some 
other reasons. Time (in number of days) recorded when catheter is reinserted. The purpose is to note the incidence rate of 
infection to risk variables (Age,Sex and disease type of kidney disorder, Glomerulo Neptiritis(GN), Acute Neptiritis(AN) 
and Polycyatic Neptiritis(PKD). The time interval recorded is considered as random variable and according to Cox PH 
model we have illustrated the estimation by MLE, using the best linear unbiased prediction (BLUP). Estimation of the 
problem is described in McGilchrist and Aisbett (1991). We have obtained maximum likelihood estimator (M.L.E.) of 
main covariates Age(Xj), Sex(X 2 ) along with other correlated covariates like GN(X 3 ), AN(X 4 ) and PKD(X 5 ). Also BLUP 
of regression coefficient, 01,02 r 03> 0a and 8 with its linear function A = /? + 38 is obtained. Not only by MLE 

Method linked with BLUP but obtained independently using linear model. 

We give below the estimates of regression coefficients, 0i.02>02> /-U.. and 8 on the basis of the data given in 
McGilchrist and Aisbett (1991) by taking time variate T of recording time when catheter is inserted first time and Y is log 
T. The maximum likelihood estimates are obtained by using (2.2), (2.3) and (2.8). where X 38x5 is a matrix representing 38 
patients and 5 covariates X b X 2 , X 3 , X 4 , X 5 and =111111 (Fj , i = 1,2,, ...,38), F; = log Fj 

4.1. APPLICATION OF (2.2), (2.3) AND (2.8) TO GET M.L.E. OF 0 S AND 8 

Following Table 4.1 gives the estimates of 0$, /? s by M.L. method and their standard errors. 



Table 4.1 



Variable 


Age 


Sex 


GN 


AN 


PKD 


Regression coefficient estimate 


0.011535 


0.228179 


0.002059 


-0.08724 


0.044094 


Standard error 


0.008532 


0.168784 


0.001523 


0.064528 


0.032616 
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The estimate of the variance is 0.021587. In general the effect of the prior distribution on frailty is to shrink the 
estimate towards the origin. The only regression coefficients i.e. significantly large compared to its standard error is that of 
the age variable, indicating a lower infection rate for the lower age of the patients. 

e = Y -Y w = 3.160408 

The value of frailty parameter is 

Inf (?) = ln(3. 160408) = 1.150701 

We note that taking C 1 = C 2 = C 3 = C 4 = C & = 1 and 5=1 that is, 
C 1 0 Hr S8 = /? s 4- 8 , the MSE of C l 0 + $8 turns out as 1.349332 and the condition given in 

(2.10) is not justified and hence MSE estimator is better than unbiased estimator of frailty parameters. 

Particular Cases 



S.N. 


Cases 


1 


c 2 




C 4 


Cb 


A 


s.e(A) 


1 


1 


0 


0 


0 


0 


1.162236 


0.008532 


2 


0 


1 


0 


0 


0 


1.37888 


0.168784 


3 


1 


1 


1 


1 


1 


1.349332 


0.146927 


4 


1 


1 


0 


0 


0 


1.390415 


0.177316 



This concludes the combined effect of age and sex on the estimate being significantly large. Age and sex has 
lower infection rate. Thus the primary covariates are much more significant rather than other covariates. 

4.2. APPLICATION OF (3.8) AND (3.9) TO GET BLUP OF ^ , § z , P 4 , P 5 AND 8 . 



Using the result (3.8) and (3.9) the usual estimates of VvMvMi and their s.e. are presented in table 4.2. 
Also the BLUP estimates of pj, Pa* P 41 Ps with their s.e. are presented in table 4.3. 

.- V AY 

Here, we have , 8 = ~ = 3.60439 



The value of frailty parameter is ln( 8) = ln(3. 60439) = 1.282152 



Table 4.2: Usual Estimates (Without Frailty) of p z p a , p + , with s.e 



Variable 


Age(pj) 


Sex(p 2 ) 


GN(p 3 ) 


AN(P 4 ) 


pkd(P e ) 


Regression coefficient estimate 


0.049381 


2.68169 


-0.4497 


-1.4122 


0.364796 


Standard error 


0.014662 


0.520854 


0.768717 


0.759195 


0.979725 



Table 4.3: Regression Estimates (With Frailty) of |3 1 ,p 2 j p 3 ., f> 4 , (3 5 with s.e 



Variable 


Age(pj) 


Sex(P 2 ) 


GN(p 3 ) 


AN(p 4 ) 


PKD(p E ) 


Regression coefficient estimate 


0.028043 


2.259586 


-0.45351 


-1.25082 


0.283228 


Standard error 


0.014662 


0.520854 


0.768717 


0.759195 


0.979725 
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From above two tables (4.2) and (4.3) we observe that the estimates of frailty parameters are smaller than those of 
usual BLUP values of Pi* Ps* Pe* P 4 ' Ps even though their estimates of s.e. are same. 

CONCLUSIONS 

Theoretically it is observed that the m.l.e. of primary covariates when frailty covariates are present in terms of 
hazard function parameter with cox PH model are more advisable to use rather than usual BLUP estimator and this is 
supported by data on kidney infection. 
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